Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 3051 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 238.5 KiB |
| Average record size in memory | 80.0 B |
Variable types
| Categorical | 2 |
|---|---|
| Numeric | 8 |
Calendar_Week has a high cardinality: 113 distinct values | High cardinality |
Paid_Views is highly correlated with Organic_Views and 1 other fields | High correlation |
Organic_Views is highly correlated with Paid_Views and 3 other fields | High correlation |
Google_Impressions is highly correlated with Division and 4 other fields | High correlation |
Email_Impressions is highly correlated with Division and 4 other fields | High correlation |
Facebook_Impressions is highly correlated with Google_Impressions and 2 other fields | High correlation |
Affiliate_Impressions is highly correlated with Division and 2 other fields | High correlation |
Overall_Views is highly correlated with Paid_Views and 1 other fields | High correlation |
Sales is highly correlated with Division and 3 other fields | High correlation |
Division is highly correlated with Google_Impressions and 3 other fields | High correlation |
Calendar_Week is uniformly distributed | Uniform |
Email_Impressions has unique values | Unique |
Reproduction
| Analysis started | 2022-09-20 05:13:18.396358 |
|---|---|
| Analysis finished | 2022-09-20 05:13:37.564478 |
| Duration | 19.17 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.0 KiB |
| Z | 226 |
|---|---|
| B | 113 |
| Y | 113 |
| X | 113 |
| W | 113 |
| Other values (21) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3051 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| Z | 226 | 7.4% |
| B | 113 | 3.7% |
| Y | 113 | 3.7% |
| X | 113 | 3.7% |
| W | 113 | 3.7% |
| V | 113 | 3.7% |
| U | 113 | 3.7% |
| T | 113 | 3.7% |
| S | 113 | 3.7% |
| R | 113 | 3.7% |
| Other values (16) | 1808 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| z | 226 | 7.4% |
| b | 113 | 3.7% |
| c | 113 | 3.7% |
| d | 113 | 3.7% |
| e | 113 | 3.7% |
| f | 113 | 3.7% |
| g | 113 | 3.7% |
| h | 113 | 3.7% |
| i | 113 | 3.7% |
| j | 113 | 3.7% |
| Other values (16) | 1808 |
Most occurring characters
| Value | Count | Frequency (%) |
| Z | 226 | 7.4% |
| B | 113 | 3.7% |
| C | 113 | 3.7% |
| D | 113 | 3.7% |
| E | 113 | 3.7% |
| F | 113 | 3.7% |
| G | 113 | 3.7% |
| H | 113 | 3.7% |
| I | 113 | 3.7% |
| J | 113 | 3.7% |
| Other values (16) | 1808 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3051 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Z | 226 | 7.4% |
| B | 113 | 3.7% |
| C | 113 | 3.7% |
| D | 113 | 3.7% |
| E | 113 | 3.7% |
| F | 113 | 3.7% |
| G | 113 | 3.7% |
| H | 113 | 3.7% |
| I | 113 | 3.7% |
| J | 113 | 3.7% |
| Other values (16) | 1808 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3051 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Z | 226 | 7.4% |
| B | 113 | 3.7% |
| C | 113 | 3.7% |
| D | 113 | 3.7% |
| E | 113 | 3.7% |
| F | 113 | 3.7% |
| G | 113 | 3.7% |
| H | 113 | 3.7% |
| I | 113 | 3.7% |
| J | 113 | 3.7% |
| Other values (16) | 1808 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3051 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Z | 226 | 7.4% |
| B | 113 | 3.7% |
| C | 113 | 3.7% |
| D | 113 | 3.7% |
| E | 113 | 3.7% |
| F | 113 | 3.7% |
| G | 113 | 3.7% |
| H | 113 | 3.7% |
| I | 113 | 3.7% |
| J | 113 | 3.7% |
| Other values (16) | 1808 |
| Distinct | 113 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.0 KiB |
| 1/6/2018 | 27 |
|---|---|
| 2/9/2019 | 27 |
| 8/10/2019 | 27 |
| 8/3/2019 | 27 |
| 7/27/2019 | 27 |
| Other values (108) |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.92920354 |
| Min length | 8 |
Characters and Unicode
| Total characters | 27243 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1/6/2018 |
|---|---|
| 2nd row | 1/13/2018 |
| 3rd row | 1/20/2018 |
| 4th row | 1/27/2018 |
| 5th row | 2/3/2018 |
Common Values
| Value | Count | Frequency (%) |
| 1/6/2018 | 27 | 0.9% |
| 2/9/2019 | 27 | 0.9% |
| 8/10/2019 | 27 | 0.9% |
| 8/3/2019 | 27 | 0.9% |
| 7/27/2019 | 27 | 0.9% |
| 7/20/2019 | 27 | 0.9% |
| 7/13/2019 | 27 | 0.9% |
| 7/6/2019 | 27 | 0.9% |
| 6/29/2019 | 27 | 0.9% |
| 6/22/2019 | 27 | 0.9% |
| Other values (103) | 2781 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 1/6/2018 | 27 | 0.9% |
| 1/13/2018 | 27 | 0.9% |
| 1/20/2018 | 27 | 0.9% |
| 1/27/2018 | 27 | 0.9% |
| 2/3/2018 | 27 | 0.9% |
| 2/10/2018 | 27 | 0.9% |
| 2/17/2018 | 27 | 0.9% |
| 2/24/2018 | 27 | 0.9% |
| 3/3/2018 | 27 | 0.9% |
| 3/10/2018 | 27 | 0.9% |
| Other values (103) | 2781 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 6102 | |
| 1 | 5400 | |
| 2 | 5211 | |
| 0 | 3807 | |
| 8 | 1944 | 7.1% |
| 9 | 1944 | 7.1% |
| 3 | 729 | 2.7% |
| 6 | 567 | 2.1% |
| 7 | 540 | 2.0% |
| 4 | 513 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 21141 | |
| Other Punctuation | 6102 | 22.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5400 | |
| 2 | 5211 | |
| 0 | 3807 | |
| 8 | 1944 | 9.2% |
| 9 | 1944 | 9.2% |
| 3 | 729 | 3.4% |
| 6 | 567 | 2.7% |
| 7 | 540 | 2.6% |
| 4 | 513 | 2.4% |
| 5 | 486 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 6102 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 27243 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 6102 | |
| 1 | 5400 | |
| 2 | 5211 | |
| 0 | 3807 | |
| 8 | 1944 | 7.1% |
| 9 | 1944 | 7.1% |
| 3 | 729 | 2.7% |
| 6 | 567 | 2.1% |
| 7 | 540 | 2.0% |
| 4 | 513 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27243 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 6102 | |
| 1 | 5400 | |
| 2 | 5211 | |
| 0 | 3807 | |
| 8 | 1944 | 7.1% |
| 9 | 1944 | 7.1% |
| 3 | 729 | 2.7% |
| 6 | 567 | 2.1% |
| 7 | 540 | 2.0% |
| 4 | 513 | 1.9% |
| Distinct | 2345 |
|---|---|
| Distinct (%) | 76.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15093.94166 |
| Minimum | 1 |
|---|---|
| Maximum | 518190 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 132.5 |
| Q1 | 537 |
| median | 2699 |
| Q3 | 17358 |
| 95-th percentile | 64468 |
| Maximum | 518190 |
| Range | 518189 |
| Interquartile range (IQR) | 16821 |
Descriptive statistics
| Standard deviation | 30785.88498 |
|---|---|
| Coefficient of variation (CV) | 2.039618655 |
| Kurtosis | 45.4428633 |
| Mean | 15093.94166 |
| Median Absolute Deviation (MAD) | 2560 |
| Skewness | 5.199006148 |
| Sum | 46051616 |
| Variance | 947770713.9 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 856 | 6 | 0.2% |
| 500 | 6 | 0.2% |
| 159 | 6 | 0.2% |
| 534 | 6 | 0.2% |
| 554 | 5 | 0.2% |
| 255 | 5 | 0.2% |
| 261 | 5 | 0.2% |
| 729 | 5 | 0.2% |
| 269 | 5 | 0.2% |
| 606 | 5 | 0.2% |
| Other values (2335) | 2997 |
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 2 | 1 | < 0.1% |
| 3 | 2 | |
| 4 | 2 | |
| 5 | 1 | < 0.1% |
| 6 | 2 | |
| 10 | 1 | < 0.1% |
| 11 | 2 | |
| 12 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 518190 | 1 | |
| 361081 | 1 | |
| 321082 | 1 | |
| 320747 | 1 | |
| 258535 | 1 | |
| 252545 | 1 | |
| 237467 | 1 | |
| 210475 | 1 | |
| 200144 | 1 | |
| 195738 | 1 |
| Distinct | 2553 |
|---|---|
| Distinct (%) | 83.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13355.67322 |
| Minimum | 1 |
|---|---|
| Maximum | 270453 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 144 |
| Q1 | 712.5 |
| median | 4110 |
| Q3 | 16230.5 |
| 95-th percentile | 54600 |
| Maximum | 270453 |
| Range | 270452 |
| Interquartile range (IQR) | 15518 |
Descriptive statistics
| Standard deviation | 24079.39969 |
|---|---|
| Coefficient of variation (CV) | 1.802934176 |
| Kurtosis | 25.8861535 |
| Mean | 13355.67322 |
| Median Absolute Deviation (MAD) | 3814 |
| Skewness | 4.232216941 |
| Sum | 40748159 |
| Variance | 579817489.4 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 273 | 6 | 0.2% |
| 376 | 6 | 0.2% |
| 739 | 5 | 0.2% |
| 976 | 5 | 0.2% |
| 963 | 5 | 0.2% |
| 992 | 5 | 0.2% |
| 697 | 5 | 0.2% |
| 927 | 5 | 0.2% |
| 157 | 5 | 0.2% |
| 857 | 4 | 0.1% |
| Other values (2543) | 3000 |
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 3 | 2 | |
| 4 | 1 | < 0.1% |
| 5 | 3 | |
| 8 | 2 | |
| 9 | 3 | |
| 10 | 2 | |
| 11 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 270453 | 1 | |
| 246839 | 1 | |
| 245612 | 1 | |
| 243245 | 1 | |
| 201438 | 1 | |
| 199223 | 1 | |
| 198041 | 1 | |
| 195013 | 1 | |
| 190041 | 1 | |
| 181024 | 1 |
| Distinct | 2913 |
|---|---|
| Distinct (%) | 95.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 886173.8309 |
| Minimum | 7 |
|---|---|
| Maximum | 17150439 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.0 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 265 |
| Q1 | 169828 |
| median | 490531 |
| Q3 | 1022621.5 |
| 95-th percentile | 3261280 |
| Maximum | 17150439 |
| Range | 17150432 |
| Interquartile range (IQR) | 852793.5 |
Descriptive statistics
| Standard deviation | 1355075.817 |
|---|---|
| Coefficient of variation (CV) | 1.529130933 |
| Kurtosis | 33.24804215 |
| Mean | 886173.8309 |
| Median Absolute Deviation (MAD) | 386893 |
| Skewness | 4.555611693 |
| Sum | 2703716358 |
| Variance | 1.83623047 × 1012 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 196 | 4 | 0.1% |
| 200 | 4 | 0.1% |
| 683 | 4 | 0.1% |
| 110 | 4 | 0.1% |
| 184 | 3 | 0.1% |
| 234 | 3 | 0.1% |
| 542 | 3 | 0.1% |
| 668 | 3 | 0.1% |
| 226 | 3 | 0.1% |
| 516 | 3 | 0.1% |
| Other values (2903) | 3017 |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 8 | 2 | |
| 11 | 2 | |
| 16 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 22 | 3 | |
| 25 | 1 | < 0.1% |
| 28 | 3 |
| Value | Count | Frequency (%) |
| 17150439 | 1 | |
| 16420655 | 1 | |
| 16001714 | 1 | |
| 14455623 | 1 | |
| 13994100 | 1 | |
| 12556735 | 1 | |
| 10735172 | 1 | |
| 10645783 | 1 | |
| 10141086 | 1 | |
| 10116282 | 1 |
| Distinct | 3051 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 760509.3778 |
| Minimum | 40894.44732 |
|---|---|
| Maximum | 7317730.249 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.0 KiB |
Quantile statistics
| Minimum | 40894.44732 |
|---|---|
| 5-th percentile | 133318.5653 |
| Q1 | 378496.9247 |
| median | 590970.802 |
| Q3 | 962246.6105 |
| 95-th percentile | 1895692.071 |
| Maximum | 7317730.249 |
| Range | 7276835.802 |
| Interquartile range (IQR) | 583749.6858 |
Descriptive statistics
| Standard deviation | 626014.1235 |
|---|---|
| Coefficient of variation (CV) | 0.8231510904 |
| Kurtosis | 11.20751273 |
| Mean | 760509.3778 |
| Median Absolute Deviation (MAD) | 261356.6528 |
| Skewness | 2.587596985 |
| Sum | 2320314112 |
| Variance | 3.918936829 × 1011 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 349895.0107 | 1 | < 0.1% |
| 1061981.232 | 1 | < 0.1% |
| 181384.2373 | 1 | < 0.1% |
| 564751.8348 | 1 | < 0.1% |
| 206812.4045 | 1 | < 0.1% |
| 163574.101 | 1 | < 0.1% |
| 235901.3005 | 1 | < 0.1% |
| 169795.3193 | 1 | < 0.1% |
| 889099.7984 | 1 | < 0.1% |
| 1286652.231 | 1 | < 0.1% |
| Other values (3041) | 3041 |
| Value | Count | Frequency (%) |
| 40894.44732 | 1 | |
| 42547.90503 | 1 | |
| 52290.50056 | 1 | |
| 52430.51882 | 1 | |
| 54682.40674 | 1 | |
| 55208.45833 | 1 | |
| 55803.78371 | 1 | |
| 55964.30824 | 1 | |
| 56325.09614 | 1 | |
| 56351.78843 | 1 |
| Value | Count | Frequency (%) |
| 7317730.249 | 1 | |
| 5160763.736 | 1 | |
| 5153551.869 | 1 | |
| 5049751.178 | 1 | |
| 4723940.216 | 1 | |
| 4653501.774 | 1 | |
| 4106322.861 | 1 | |
| 4088594.141 | 1 | |
| 4085227.767 | 1 | |
| 4071753.83 | 1 |
| Distinct | 3036 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 269126.8879 |
| Minimum | 29 |
|---|---|
| Maximum | 7558435 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.0 KiB |
Quantile statistics
| Minimum | 29 |
|---|---|
| 5-th percentile | 13335.5 |
| Q1 | 57073.5 |
| median | 127523 |
| Q3 | 283505 |
| 95-th percentile | 963280 |
| Maximum | 7558435 |
| Range | 7558406 |
| Interquartile range (IQR) | 226431.5 |
Descriptive statistics
| Standard deviation | 466511.6667 |
|---|---|
| Coefficient of variation (CV) | 1.733426453 |
| Kurtosis | 57.61272328 |
| Mean | 269126.8879 |
| Median Absolute Deviation (MAD) | 87736 |
| Skewness | 5.988878784 |
| Sum | 821106135 |
| Variance | 2.176331352 × 1011 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 30780 | 2 | 0.1% |
| 82178 | 2 | 0.1% |
| 78489 | 2 | 0.1% |
| 125550 | 2 | 0.1% |
| 251488 | 2 | 0.1% |
| 453 | 2 | 0.1% |
| 5895 | 2 | 0.1% |
| 30140 | 2 | 0.1% |
| 223006 | 2 | 0.1% |
| 21462 | 2 | 0.1% |
| Other values (3026) | 3031 |
| Value | Count | Frequency (%) |
| 29 | 1 | |
| 79 | 1 | |
| 88 | 1 | |
| 122 | 1 | |
| 134 | 1 | |
| 139 | 1 | |
| 141 | 1 | |
| 218 | 1 | |
| 248 | 1 | |
| 323 | 1 |
| Value | Count | Frequency (%) |
| 7558435 | 1 | |
| 6830232 | 1 | |
| 6232118 | 1 | |
| 4599296 | 1 | |
| 4405999 | 1 | |
| 4313221 | 1 | |
| 4287559 | 1 | |
| 3994617 | 1 | |
| 3888224 | 1 | |
| 3759476 | 1 |
| Distinct | 2936 |
|---|---|
| Distinct (%) | 96.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22910.90265 |
| Minimum | 910 |
|---|---|
| Maximum | 175791 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.0 KiB |
Quantile statistics
| Minimum | 910 |
|---|---|
| 5-th percentile | 3237 |
| Q1 | 9127 |
| median | 16658 |
| Q3 | 27486.5 |
| 95-th percentile | 69527 |
| Maximum | 175791 |
| Range | 174881 |
| Interquartile range (IQR) | 18359.5 |
Descriptive statistics
| Standard deviation | 21617.6375 |
|---|---|
| Coefficient of variation (CV) | 0.9435524136 |
| Kurtosis | 6.882995576 |
| Mean | 22910.90265 |
| Median Absolute Deviation (MAD) | 8472 |
| Skewness | 2.316177224 |
| Sum | 69901164 |
| Variance | 467322250.9 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 8954 | 3 | 0.1% |
| 12533 | 3 | 0.1% |
| 13557 | 2 | 0.1% |
| 18082 | 2 | 0.1% |
| 23493 | 2 | 0.1% |
| 23566 | 2 | 0.1% |
| 33545 | 2 | 0.1% |
| 25898 | 2 | 0.1% |
| 23579 | 2 | 0.1% |
| 17651 | 2 | 0.1% |
| Other values (2926) | 3029 |
| Value | Count | Frequency (%) |
| 910 | 1 | |
| 912 | 1 | |
| 1055 | 1 | |
| 1062 | 1 | |
| 1089 | 1 | |
| 1139 | 1 | |
| 1247 | 1 | |
| 1285 | 1 | |
| 1309 | 1 | |
| 1317 | 1 |
| Value | Count | Frequency (%) |
| 175791 | 1 | |
| 156410 | 1 | |
| 152342 | 1 | |
| 143656 | 1 | |
| 143640 | 1 | |
| 141040 | 1 | |
| 136956 | 1 | |
| 134450 | 1 | |
| 131827 | 1 | |
| 131055 | 1 |
| Distinct | 2601 |
|---|---|
| Distinct (%) | 85.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27980.91413 |
| Minimum | 2 |
|---|---|
| Maximum | 635057 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.0 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 164.5 |
| Q1 | 747 |
| median | 7879 |
| Q3 | 34111.5 |
| 95-th percentile | 115101 |
| Maximum | 635057 |
| Range | 635055 |
| Interquartile range (IQR) | 33364.5 |
Descriptive statistics
| Standard deviation | 52054.97669 |
|---|---|
| Coefficient of variation (CV) | 1.860374413 |
| Kurtosis | 25.62023576 |
| Mean | 27980.91413 |
| Median Absolute Deviation (MAD) | 7564 |
| Skewness | 4.212281555 |
| Sum | 85369769 |
| Variance | 2709720598 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 776 | 5 | 0.2% |
| 491 | 5 | 0.2% |
| 869 | 5 | 0.2% |
| 887 | 5 | 0.2% |
| 919 | 5 | 0.2% |
| 775 | 5 | 0.2% |
| 392 | 5 | 0.2% |
| 988 | 5 | 0.2% |
| 993 | 4 | 0.1% |
| 804 | 4 | 0.1% |
| Other values (2591) | 3003 |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 2 | |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 3 | |
| 8 | 2 | |
| 11 | 2 | |
| 12 | 1 | < 0.1% |
| 13 | 4 |
| Value | Count | Frequency (%) |
| 635057 | 1 | |
| 563991 | 1 | |
| 462512 | 1 | |
| 446589 | 1 | |
| 438477 | 1 | |
| 436507 | 1 | |
| 432585 | 1 | |
| 388322 | 1 | |
| 386044 | 1 | |
| 381029 | 1 |
| Distinct | 3030 |
|---|---|
| Distinct (%) | 99.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 185901.3966 |
| Minimum | 15436 |
|---|---|
| Maximum | 3575430 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 24.0 KiB |
Quantile statistics
| Minimum | 15436 |
|---|---|
| 5-th percentile | 44684 |
| Q1 | 73393.5 |
| median | 113573 |
| Q3 | 202975.5 |
| 95-th percentile | 588421.5 |
| Maximum | 3575430 |
| Range | 3559994 |
| Interquartile range (IQR) | 129582 |
Descriptive statistics
| Standard deviation | 232207.9011 |
|---|---|
| Coefficient of variation (CV) | 1.249091752 |
| Kurtosis | 54.06561259 |
| Mean | 185901.3966 |
| Median Absolute Deviation (MAD) | 49231 |
| Skewness | 5.622929513 |
| Sum | 567185161 |
| Variance | 5.392050934 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 73713 | 2 | 0.1% |
| 116050 | 2 | 0.1% |
| 72504 | 2 | 0.1% |
| 76308 | 2 | 0.1% |
| 101756 | 2 | 0.1% |
| 182603 | 2 | 0.1% |
| 76957 | 2 | 0.1% |
| 72269 | 2 | 0.1% |
| 74590 | 2 | 0.1% |
| 59974 | 2 | 0.1% |
| Other values (3020) | 3031 |
| Value | Count | Frequency (%) |
| 15436 | 1 | |
| 18432 | 1 | |
| 18440 | 1 | |
| 19899 | 1 | |
| 20377 | 1 | |
| 20822 | 1 | |
| 20849 | 1 | |
| 21092 | 1 | |
| 21596 | 1 | |
| 21772 | 1 |
| Value | Count | Frequency (%) |
| 3575430 | 1 | |
| 3561292 | 1 | |
| 3322758 | 1 | |
| 2424124 | 1 | |
| 2363272 | 1 | |
| 1897738 | 1 | |
| 1859541 | 1 | |
| 1791713 | 1 | |
| 1756387 | 1 | |
| 1712792 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Division | Calendar_Week | Paid_Views | Organic_Views | Google_Impressions | Email_Impressions | Facebook_Impressions | Affiliate_Impressions | Overall_Views | Sales | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | A | 1/6/2018 | 392 | 422 | 408 | 349895.0107 | 73580 | 12072 | 682 | 59417 |
| 1 | A | 1/13/2018 | 787 | 904 | 110 | 506270.2176 | 11804 | 9499 | 853 | 56806 |
| 2 | A | 1/20/2018 | 81 | 970 | 742 | 430042.1538 | 52232 | 17048 | 759 | 48715 |
| 3 | A | 1/27/2018 | 25 | 575 | 65 | 417745.6658 | 78640 | 10207 | 942 | 72047 |
| 4 | A | 2/3/2018 | 565 | 284 | 295 | 408505.8012 | 40561 | 5834 | 658 | 56235 |
| 5 | A | 2/10/2018 | 256 | 330 | 683 | 434729.7550 | 36750 | 8469 | 691 | 56347 |
| 6 | A | 2/17/2018 | 886 | 56 | 664 | 634432.9117 | 112489 | 8331 | 685 | 81604 |
| 7 | A | 2/24/2018 | 336 | 99 | 470 | 555036.3088 | 218 | 6319 | 569 | 80492 |
| 8 | A | 3/3/2018 | 305 | 209 | 501 | 423690.0837 | 13065 | 7898 | 772 | 61804 |
| 9 | A | 3/10/2018 | 955 | 283 | 609 | 471730.0390 | 84449 | 8428 | 833 | 64944 |
Last rows
| Division | Calendar_Week | Paid_Views | Organic_Views | Google_Impressions | Email_Impressions | Facebook_Impressions | Affiliate_Impressions | Overall_Views | Sales | |
|---|---|---|---|---|---|---|---|---|---|---|
| 3041 | Z | 12/28/2019 | 8438 | 17143 | 598200 | 3.897592e+05 | 111386 | 8484 | 24536 | 120823 |
| 3042 | Z | 1/4/2020 | 33345 | 21075 | 306532 | 4.496571e+05 | 110314 | 9485 | 53762 | 132942 |
| 3043 | Z | 1/11/2020 | 8568 | 25140 | 289894 | 5.970322e+05 | 149930 | 9836 | 33988 | 94164 |
| 3044 | Z | 1/18/2020 | 17725 | 23274 | 327776 | 5.656911e+05 | 158896 | 17501 | 40339 | 104771 |
| 3045 | Z | 1/25/2020 | 23817 | 22134 | 560621 | 4.684737e+05 | 123430 | 13474 | 44967 | 77487 |
| 3046 | Z | 2/1/2020 | 29239 | 25311 | 622406 | 1.459071e+06 | 45026 | 12098 | 53667 | 82707 |
| 3047 | Z | 2/8/2020 | 26230 | 28031 | 624409 | 5.342505e+05 | 227070 | 9548 | 53665 | 84503 |
| 3048 | Z | 2/15/2020 | 24749 | 31281 | 439362 | 4.227182e+05 | 393685 | 9861 | 55561 | 147325 |
| 3049 | Z | 2/22/2020 | 20713 | 30356 | 464178 | 6.085799e+05 | 424676 | 10221 | 49221 | 111525 |
| 3050 | Z | 2/29/2020 | 15990 | 26993 | 449032 | 4.390165e+05 | 161439 | 10294 | 42994 | 98187 |